Approximations for Monotone and Non-monotone Submodular 
Maximization with Knapsack Constraints* 
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Abstract 



Submodular maximization generalizes many fundamental problems in discrete opti- 
mization, including Max-Cut in directed/undirected graphs, maximum coverage, maximum 
ly-j ■ facility location and marketing over social networks. 

In this paper we consider the problem of maximizing any submodular function subject 
to d knapsack constraints, where d is a fixed constant. We establish a strong relation 
between the discrete problem and its continuous relaxation, obtained through extension 
by expectation of the submodular function. Formally, we show that, for any non-negative 
submodular function, an a-approximation algorithm for the continuous relaxation implies a 



^ ■ randomized (a — e)-approximation algorithm for the discrete problem. We use this relation 

to improve the best known approximation ratio for the problem to 1/4 — e, for any e > 0, 
and to obtain a nearly optimal (1 — e~^ —e)— approximation ratio for the monotone case, for 
any e > 0. We further show that the probabilistic domain defined by a continuous solution 
can be reduced to yield a polynomial size domain, given an oracle for the extension by 
I expectation. This leads to a deterministic version of our technique. 

a^ ' 
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^ ■ 

1 Introduction 

A real- valued function /, whose domain is all the subsets of a universe U, is called submodular 
if, for any S,T C U, 

f{S) + f{T)>f{SuT) + f{SnT). 

The concept of submodularity, which can be viewed as a discrete analog of convexity, plays a 
central role in combinatorial theorems and algorithms (see, e.g., and the references therein, 
and the comprehensive surveys in [lOl [2J1 [19]). Submodular maximization generalizes many 
fundamental problems in discrete optimization, including Max-Cut in directed/undirected 
graphs, maximum coverage, maximum facility location and marketing over social networks 
(see, e.g., [I3]). 

In many settings, including set covering or matroid optimization, the underlying submodu- 
lar functions are monotone, meaning that f{S) < f{T) whenever S C T. In other settings, the 
function f{S) is not necessarily monotone. A classic example of such a submodular function 
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is f{S) = J2e£5{S) ^(6)' where 5{S) is a cut in a graph (or hypergraph) G = (y,E) induced 
by a set of vertices S V, and w{e) is the weight of an edge e C. E. An example for a 
monotone submodular function is fcp : 2^ — t- M, defined on a subset of vertices in bipartite 
graph G = (L, R, E). For any 5 C 1/, fG,p{S) = Yl,veN(S) P^^ where N{S) is the neighborhood 
function (i.e., N{S) is the set of neighbors of S), and > is the profit of v, for any v G R. 
The problem max{fG,p{S)\ \S\ < k} is classical maximum coverage. 

In this paper we consider the following problem of maximizing a non-negative submodular 
set function subject to d knapsack constraints (SUB). Given a d-dimensional budget vector L, 
for some d > 1, and an oracle for a non- negative submodular set function / over a universe U, 
where each element i G C/ is associated with a d-dimensional cost vector c(i), we seek a subset 
of elements 5 C [/ whose total cost is at most L, such that f{S) is maximized. 

There has been extensive work on maximizing submodular monotone functions subject to 
matroid constraint^ For the special case of uniform matroid, i.e., the problem {max/(S') : 

< k}, for some k > 1, Nemhauser et. al showed in [21] that a greedy algorithm yields a 
ratio of 1 — to the optimum. Later works presented greedy algorithms that achieve this 
ratio for other special matroids or for variants of maximum coverage (see, e.g., [H [T5| [231 [5]). 
For a general matroid constraint, Calinescu et al. showed in [2] that a scheme based on solving 
a continuous relaxation of the problem followed by pipage rounding (a technique introduced by 
Ageev and Sviridenko [Ij) achieves the ratio of 1 — for maximizing submodular monotone 
functions that can be expressed as a sum of weighted rank functions of matroids. Subsequently, 
this result was extended by Vondrak }24j to general monotone submodular functions. 

The bound of 1 — e~^ is the best possible for all of the above problems. This follows from 
the lower bound of Nemhauser and Wolsey |20] in the oracle model, and the later result of 
Feige [9] for the specific case of maximum coverage, under the assumption that P ^ NP. 

Other variants of monotone submodular optimization were also considered. In Bansal et 
al. studied the problem of maximizing a monotone submodular function subject to n knapsack 
constraints, for arbitrary n > 1, where each element appears in up to k constraints, and k is 
fixed. The paper presents a |^ and f:^ + o{k) approximations for this problem. Demaine and 
Zadimoghaddam [8] studied bi-criteria approximations for monotone submodular set function 
optimization. 

The problem of maximizing a non-monotone submodular function has been studied as 
well. Feige et al. [10] considered (unconstrained) maximization of a general non-monotone 
submodular function. The paper gives several (randomized and deterministic) approximation 
algorithms, as well as hardness results, also for the special case where the function is symmetric. 

Lee et al. [19] studied the problem of maximizing a general submodular function under 
linear and matroid constraints. They proposed algorithms that achieve approximation ratio 
of 1/5 — e for the problem with d linear constraints and a ratio oi l/{d + 2 + 1/d + e) for d 
matroid constraints, for any fixed integer d > 1. 

Improved lower and upper bounds for non-constrained and constrained submodular max- 
imization were recently derived by Gharan and Vondrak [12]. However, this paper does not 
consider knapsack constraints. 

Several fundamental algorithms for submodular maximization (see, e.g., [Tl [4} [24} I19j) use 
a continuous extension of submodular function, to which we refer as extension by expectation. 
Given a submodular function / : 2^ — > M, we define F : [0, 1]^ — > M. For any y € [0, 1]^, let 

(weighted) matroid is a system of 'independent subsets' of a universe, which satisfies certain hereditary 
and exchange properties [22| . 
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i? C [/ be a random variable such that i R with probabihty t/j (we say that R ^ y). Then 

F{y) = E[f{R)] = (fiR) n 11(1 - yA . 

RCU \ ieR i^R. J 

The general framework of these algorithms is to obtain first a fractional solution for the con- 
tinuous extension, followed by rounding which yields a solution for the discrete problem. 

Using the definition of F, we define the continuous relaxation of our problem called con- 
tinuous SUB. Let P = {y G [0, 1]^| YliieU Vi'^^^) — ^} polytope of the instance, then 
the problem is to find y (z P for which F{y) is maximized. For a G (0, 1], an algorithm A 
yields a-approximation for the continuous problem with respect to a submodular function /, 
if for any assignment of non- negative costs to the elements, and for any non- negative budget, 
A finds a feasible solution for continuous SUB of value at least aO, where O is the value of an 
optimal (integral) solution for SUB with the given costs and budget. 

For some specific families of submodular functions, linear programming can be used to 
derive such approximation algorithms (see e.g [U S]). For monotone submodular functions, 
Vondrak presented in [24] a (1 — — o(l))-approximation algorithm for the continuous prob- 
lem. Subsequently, Lee et al. |19j considered the problem of maximizing any submodular 
function with multiple knapsack constraints and developed a (| — o(l))-approximation algo- 
rithm for the continuous problem; however, noting that the rounding method of [IB]J1 which 
proved useful for monotone functions, cannot be applied in the non- monotone case, a (^ — e)- 
approximation was obtained for the discrete problem, by using simple randomized rounding. 
This gap of approximation ratio between the continuous and the discrete case led us to further 
develop the technique in [18], so that it can be applied also for non- monotone functions. 

1.1 Our Results 

In this paper we establish a strong relation between the problem of maximizing any submodular 
function subject to d knapsack constraints and its continuous relaxation. Formally, we show (in 
Theorem I2.6P that for any non-negative submodular function, an a-approximation algorithm 
for the continuous relaxation implies a randomized {a — e)-approximation algorithm for the 
discrete problem. We use this relation to obtain approximation ratio of 1/4 — e for SUB, for 
any e > 0, thus improving the best known result for the problem, due to Lee et al. [19]. For the 
case where the objective function is monotone, we use this relation to obtain a nearly optimal 
(1 — — e) approximation, for any e > 0. An important consequence of the above relation is 
that for any class of submodular functions, a future improvement of the approximation ratio 
for the continuous problem, to a factor of a, immediately implies an approximation ratio of 
(a — e) for the original instance. 

Our technique applies random sampling on the solution space, using a distribution defined 
by the fractional solution for the problem. In Section [3] we show how to convert a feasible 
solution for the continuous problem to another feasible solution with up to 0(log|[/|) frac- 
tional entries, given an oracle to the extension by expectation. This facilitates the usage 
of exhaustive search instead of sampling, which leads to a deterministic version of our tech- 
nique. Specifically, we obtain a deterministic (1/4 — e)-approximation for general instances 
and (1 — — e)-approximation for instances where the submodular function is monotone. 
For the special case of maximum coverage with d knapsack constraints, that is, SUB where 

^ The paper [18J is a preliminary version of this paper. 
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the objective function is / = f^p for a given bipartite graph G and profits p, this result leads 
to a deterministic (1 — — e)— approximation algorithm, since the extension by expectation 
of /g,p can be deterministically evaluated. Some basic properties of submodular functions are 
given in Appendix El 

1.2 Recent Developments 

Subsequent to our study of maximizing monotone submodular functions subject to multiple 
knapsack constraints [18j, Chekuri et al. ^ showed that, by using a more sophisticated round- 
ing technique, the algorithm in [18] can be applied to derive a (1 — — e)-approximation for 
maximizing a submodular function subject to d knapsack constraints and a matroid constraint. 
Specifically, given a fractional solution for the problem, the authors define a probability dis- 
tribution over the solution space, such that all of elements in the domain of the distribution 
are inside the matroid; these elements also satisfy Chernoff-type concentration bounds, which 
can be used to prove some of the probabilistic claims in [18]. The desired approximation ratio 
is obtained by using the algorithm of [18] with sampling replaced by the above distribution in 
the rounding step. Recently, the same set of authors improved in [7] the bound of (1/4 — e) 
presented here to 0.325. 

2 Maximizing Submodular Functions 

In this section we describe our framework for maximizing a submodular set function subject 
to multiple linear constraints. For short, we call this problem SUB. 

2.1 Preliminaries 

Notation: An essential component in our framework is the distinction between elements by 
their costs. We say that an element i is small if c(i) < e^L; otherwise, the element is big. 

Given a universe U, we call a subset of elements 5 C f/ feasible if the total cost of elements 
in S is bounded by L. We say that S is e-nearly feasible (or nearly feasible, if e is known from 
the context) if the total cost of the elements in S is bounded by (1 + e)L. We refer to f{S) as 
the value of S. Similar to the discrete case, y E [0, 1]*^ is feasible if y G P. 

For any subset T C we define /t : 2^ ^ R+ by /t(5') = f{S U T) - /(T). It is easy to 
verify that if / is a submodular set function then /t is also a submodular set function. Finally, 
for any set C [7, we define Cr{S) = X]jG5 Cr(i), where 1 < r < d, and c{S) = Yli(^s ^(*)- ■'^^^ 
a fractional solution y G [0, 1]*^, we define Cr{y) = J2ieU ^rii) • Ui and c{y) = Y^^^jj c{i) ■ yi. 

Overview: Our algorithm consists of two phases, to which we refer as rounding procedure 
and profit enumeration. The rounding procedure yields an [a — 0(e))-approximation for in- 
stances in which there are no big elements, using an o-approximate solution for the continuous 
problem. It relies heavily on Theorem 12.11 that gives some conditions on the probabilistic do- 
main of solutions; these conditions guarantee that the expected profit of the resulting nearly 
feasible solution is high. This solution is then converted to a feasible one, by using a fixing 
procedure. We first present a randomized version and later show how to derandomize the 
rounding procedure. 

The profit enumeration phase uses enumeration over the most profitable elements in an 
optimal solution; then it reduces a general instance to another instance with no big elements, 



4 



on which we apply the rounding procedure. 

Finally, we combine the above results with an algorithm for the continuous problem (e.g., 
the algorithm of ^24j . or [19j ) to obtain approximation algorithm for SUB. 

2.2 A Probabilistic Theorem 

We first prove a general probabilistic theorem which refers to a slight generalization of our 
problem (called generalized SUB). In addition to the standard input for the problem, there is 
also a collection of subsets C 2*^, such that if T G 7W and 5 C T then S £ Ai. The goal is 
to find a subset S C A4, such that c{S) < L and f{S) is maximized. 

Theorem 2.1 For a given input of generalized SUB, let x be a distribution over AA and D a 
random variable D ~ x> such that 

1. E [f[D)] > O/b, where O is an optimal solution for the given instance. 

2. For anyl<r <d, E[cr{D)] < Lr 

3. For any 1 < r < d, Cr{D) = YlT=i (^r{Dk), where ^ Xk and Di, . . . ,Dm are indepen- 
dent random variables. 

4- For any 1 < k < m and 1 < r < d, it holds that either Cr{Dj.) < e^L^ or Cr{DjS) is fixed. 

Let D' = D if D is e-nearly feasible, and D' = otherwise. Then D' is always e -nearly feasible, 
D' G M, and E[f{D')] > (1 - 0{e))E[f{D)]. 

To prove the results in this section, it suffices to use a special case of Theorem 12.11 (for- 
mulated as our next result). We use this theorem in its full generality in [T7|, in developing 
approximation algorithms for variants of maximum coverage and GAP. 

Lemma 2.2 Let x G [0,1]^ be a feasible fractional solution such that F{x) > 0/5, where O 
is the optimal solution for generalized SUB. Let D Q U be a random set such that D ~ x 
(i.e., for all i G U , i £ R with probability Xi), and let D' be a random set such that D' = D 
if D is £-nearly feasible, and D' = otherwise. Then D' is always e-nearly feasible, and 
E[f{D')]>{l-0{e))F{x). 

Proof of Theorem I2.lt Define an indicator random variable F such that F = 1 ii D is 
e-nearly feasible, and F = otherwise. 
Claim 2.1 Pr[F = 0] < de. 

Proof: For any dimension 1 < r < d, it holds that E[cr{D)] = ^1^=1 -^[^r{Dk)] ^ ^r- Define 
Vr = {k\cr{Dk) is not fixed}. Then, 

m 

Var[cr{D)] = ^ Var[cr{Dk)] < ^ E[cl{Dk)] 

k=l keVr 

m 

< E[cr{Dk)]-e'Lr < e^LrY,E[cr{Dk)] < e'Ll 

keVr k=l 

The first inequality holds since yar[X] < and the second inequality follows from the 

fact that Cr{Dk) < e^Lj. for k £Vr. Recall that, by the Chebyshev-Cantelli inequality, for any 
t > and a random variable Z, 

Pr[Z-E[Z]>t]< 



Var[Z]+t^' 
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Thus, 

Pr [cr{D) > (1 + e)Lr] = Pr [cr{D) - E[cr{D)] > (1 + e)L, - E[cr{D)]] 

< Pr[cr{D)-E[cr{D)]>e-Lr]<^^^=e. 

By the union bound, we have that 

d 



Pr[F = 0] < ^ Pr[cr{D) > (1 + e)Lr] < de. 



r=X 



□ 



For any dimension 1 < r < lei Rr = ^^^^^ , and define R = maXf. Rf. , then R denotes 
the maximal relative deviation of the cost from the r-th entry in the budget vector, where the 
maximum is taken over 1 < r < d. 

Claim 2.2 For any £> 1, 

de^ 

Proof: By the Chebyshev-Cantelli inequality we have that, for any dimension 1 < r < d, 
Pr[Rr >l] = Pr[cr{D) > £ ■ Lr] 

< Pr[Cr{D)-E[Cr{D)]>{i-l)Lr]< 



(^-1)2L2 - (£-1)2' 

and by the union bound, we get that 

de^ 



Pr[R >£]< 



(£-1)2- 



□ 



Claim 2.3 For any integer i > 1, if R < i then 

f{D) < 2dl ■ O. 

Proof: The set D can be partitioned to 2d£ sets Z^i, . . . D2dt such that each of these sets is 
a feasible solution. Hence, f{Di) < O. By Lemma lA.H we have that f{D) < f{Di) + . . . + 
f{D2M) < 2d£0. □ 

Combining the above results we have 
Claim 2.4 E[fiD')] > (1 - Oie))E[f{D)]. 
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Proof: By Claims 12.11 and I2.2[ we have that 



E[f{D)] = E[f{D)\F = l]-Pr[F = l]+E[f{D)\F = 0A{R<2)]-Pr[F = 0A{R<2)] 
+ ^E \^fiD)\ F = A (2^ < i? < 2^^+^) -Pr F = A (2^ < i? < 2^+^ 



< E[f{D)\ F = !]■ Pr[F =1]+ A(fe ■ O + (fe^ • ^ • 



Since the last summation is a constant, and F[/(D)] > 0/2, we have that 
E\F(D)\ < E[f{D)\F = l]Pr [F = 1] + e ■ c ■ E[F{D)], 
where c > is some constant. It follows that 

(1 - 0{e))E[fiD)] < E[f{D)\F = l]-Pr[F= 1]. 
Finally, since D' = D \i F = 1 and D' = Q otherwise, we have that 

E[f(D')\ = E[f{D)\F = 1] . Pr [F = 1] > (1 - 0{e))E[f{D)]. 

□ 

By definition, D' is always e-nearly feasible, and D' ^ M.. This completes the proof of the 
theorem. □ 



2.3 Rounding Instances with No Big Elements 

In this section we present an (a — 0(e))-approximation algorithm for SUB inputs with no 
big elements, given an a-approximate solution for the continuous problem. Inputs with no 
big elements are easier to tackle. Indeed, any nearly feasible solution for such input can be 
converted to a feasible one, with only a small harm to the total value. 

Lemma 2.3 Let S he an e-nearly feasible solution with no big elements, then S can be 
converted in polynomial time to a feasible solution S' C S, such that f{S') > (1 — 0{e)) f{S). 
Proof: In fixing the solution S we handle each dimension separately. For any dimension 
1 < r < d, if Cr{S) < Lr then no modification is needed; otherwise, Cr{S) > Lr- Since all 
elements in S are small, we can partition S into i disjoint subsets Si, S2, ■ ■ ■ , Si such that 
eLr < Cr{Sj) < (e + e^)Lr for any 1 < j < i, where i = Q{e~^). Since the function / is 
submodular, by Lemma fA. 31 we have that f{S) > X]j=i fs\Sj{Sj)- Hence, there exists a value 

j € {1,2 . . . ,i} such that fs\s- (Sj) < = fiS) ■ 0(e) (note that fs\siSj) may be negative). 
Now, CriS \ Sj) < Lr, and f(s \ Sj) > (1 - 0{e))f{S). We repeat this step for ah 1 < r < d 
to obtain a feasible set S' satisfying f{S') > (1 — 0(e)) f{S). □ 

Combined with Theorem 12. H we have the following rounding algorithm. 

Randomized Rounding Algorithm for SUB with No Big Elements 

Input: A SUB instance, a feasible solution x for the continuous problem, with F{x) > 0/5. 

1. Define a random set D ^ x. Let D' = D \i D \s e-nearly feasible, and D' = $ otherwise. 
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2. Convert D' to a feasible set D" as in the proof of Lemma 12.31 and return D" . 

Clearly, the algorithm returns a feasible solution for the problem. By Theorem 12. 1^ 
E[f{D')\ > (1 - 0{e))F{x). By LemmaE^l E[f{D")\ > (1 - 0{e))F{x). Hence, we have 
Lemma 2.4 For any instance of SUB with no big elements, any feasible solution x for the con- 
tinuous problem with F[x) > 0/5 can be converted to a feasible solution for SUB in polynomial 
running time with expected profit at least (1 — 0(e)) • F{x). 

2.4 A Randomized Approximation Algorithm 

Given an instance of SUB and a subset T U, define another instance of SUB, to which we 
refer as the residual problem with respect to T, with / remaining the objective function. The 
budget for the residual problem is L' = L — c{T), and the universe U' consists of all elements 
i G U \ T such that c(i) < e^Z', and all elements in T. Formally, 

U' = TU {i £U\T\ c{i) < e^L'} . 

The new cost of element i is c'{i) = c{i) for any z € U' \ T, and c'{i) = for any i ^ T. It 
follows that there are no big elements in the residual problem. Let S" be a feasible solution for 
the residual problem with respect to T. Then c{S) < c'{S) + c{T) <L' + c{T) = L. Thus, any 
feasible solution for the residual problem is also feasible for the original instance. 
Consider the following algorithm. 

A Randomized Approximation Algorithm for SUB 

Input: A SUB instance and an a-approximation algorithm A for continuous SUB with respect 
to the function /. 

1. For any T C [/ such that \T\<h = \d- e"^] 

(a) Use A to obtain an a-approximate solution x for the continuous residual problem 
with respect to T. 

(b) Use the Randomized Rounding Algorithm of Section 12.31 to convert x to a feasible 
solution S for the residual problem. 

2. Return the best solution found. 

Lemma 2.5 The above approximation algorithm returns an {a — 0(e)) -approximate solution 
for SUB and uses a polynomial number of calls to algorithm A. 

Proof: By Lemma 12.41 each iteration the algorithm finds a feasible solution S for the 
residual problem. Hence, the algorithm always returns a feasible solution for the given SUB 
instance. 

Let O = {ii, . . . , ik} he an optimal solution for the input / (we use O to denote both an op- 
timal sub-collection of elements and the optimal value). For i > 1, let -ftT^ = {ii, . . . , ii}, and as- 
sume that the elements are ordered by their residual profits, i.e., ii = argmaxjg0\^^^_^/i^^_^ {{i})- 

Consider the iteration in which T = K^, and define O' = O Ci U' . The set O' is clearly a 
feasible solution for the residual problem with respect to T. We show a lower bound for f{0'). 
The set R = 0\0' consists of elements in O \ T that are big with respect to the residual 
instance. The total cost of elements in R is bounded by V (since O is a feasible solution), and 
thus \R\ < • d. 
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Since T = Kh, for any j e 0\T it holds that frU) < and we get /r(i?) < 

EjenMij}) < ■ = ef{T) < eO. Thus, fo'{R) < MR) < eO. Since f{0) = 

f{0') + fo'{R) < f{0') + ef{0), we have that f{0') > (1 - e)f{0). 



Thus, in this iteration we get a solution x for the residual problem with F{x) > a(l — 
e)f{0), and the solution S obtained after the rounding satisfies f{S) > (1 — 0{e))af{0). 



We summarize in the next result. 

Theorem 2.6 Let f be a submodular function, and suppose there is a polynomial time a- 
approximation algorithm for the continuous problem with respect to f. Then there is a poly- 
nomial time randomized (q — e) -approximation algorithm for SUB with respect to f , for any 



Since there is a (1/4 — o(l))-approximation algorithm for general instances of continuous SUB 
[19] . we have 

Theorem 2.7 There is a polynomial time randomized (1/4 — e)- approximation algorithm for 
SUB, for any e > 0. 

Since there is a (1 — — o(l)) approximation algorithm for SUB with monotone objective 
function [23] we have 

Theorem 2.8 There is a polynomial time randomized (1 — e~'^ — e)- approximation algorithm 
for SUB with monotone objective function, for any e > 0. 

3 A Deterministic Approximation Algorithm 

In this section we show how the algorithm of Section 12.31 can be derandomized, assuming we 
have an oracle for F, the extension by expectation of /. For some families of submodular 
functions, F can be directly evaluated; for a general function /, F can be evaluated with high 
accuracy by sampling /, as in [24] . 




The main idea is to reduce the number of fractional entries in the fractional solution x, so 
that the number of values a random set D ~ x can get is polynomial in the input size (for a 
fixed value of e). Then, we go over all the possible values, and we are promised to obtain a 
solution of high value. 

A key tool in our derandomization is the pipage rounding technique of Ageev and Sviridenko 
[1]. We give below a brief overview of the technique. For any element i G [/, define the unit 
vector i G {0, 1}^, in which ij = for any j ^ i, and ii = 1. Given a fractional solution x 
for the problem and two elements i,j, such that Xi and xj are both fractional, consider the 
vector function Xij{5) = x + 5i — 5j (Note that Xij{6) is equal to x in all entries except i,j). 
Let S'^ij and (^^j j (for short, and 6~) be the maximal and minimal value of 6 for which 
Xij{d) G [0, 1]^. In both Xij{6~^), Xij{6~), the entry of either i or j is integral. 

Define Ff-{5) = F{xij{5)) over the domain [5~,5^]. The function Ffj is convex (see [3] 
for a detailed proof), thus x' = argmax^^^ ^,(^+)^^. _^(5-)j.F(x) has fewer fractional entries than x, 
and F{x') > F{x). By appropriate selection of such that x' maintains feasibility (in some 
sense), we can repeat the above step to gradually decrease the number of fractional entries. 
We use the technique to prove the next result. 

Lemma 3.1 Let x G [0,1]^^ be a solution having k or less fractional entries (i.e., |{i | < 
Xi < 1}| < k), and c{x) < L for some L. Then x can be converted to a vector x! with at 



□ 



e > 0. 
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most k' = y j fractional entries, such that c{x') < (1 + s)L, and F{x') > F{x), in time 
polynomial in k. 

Proof: Let U' = {i \ < Xi < 1} he the set of all fractional entries. We define a new cost 
function c' over the elements in U. 



C'rii) 



Cr{i) i ^ U' 

e ■ L 

Cr{l) < 



T 2k 
' "(1 + e/2y + e/2y < c,(z) < ^(1 + s/2y+' 



2/c 2/c Qik 

Note that for any i G U', c'{i) < c{i), and 



2' ' 2k 



for all 1 < r < d. The number of different values c'^{i) can get for i £ U' is bounded by ^ 
(since all elements are small, and ln(l + x) > x/2). Hence the number of different values c'{i) 
can get for i G [/' is bounded by k' = ^ ^ M2fc) ^ ^ 

We start with x' = x, and while there are i,j G U' such that and x'j are both fractional 
and c!{i) = c! (j), define S'^ = 5f, ■ ■ and 5~ = ^ ■. Since z and j have the same cost (by c'), 
it holds that c'(Sij((5+)) = c'(x-j((5-)) = c'(x).' If i^*^.(5+) > F(x), then set .x" = Xij(5+), 
otherwise x" = Xjj(5~). In both cases F{x") > F{x') and c![x") = c'(x'). Now, repeat this 
step with x' = x" . Since in each iteration the number of fractional entries in x' decreases, the 
process will terminate (after at most k iterations) with a vector x' such that F{x') > F{x), 
<f{x') = (f{x) < L, and there are no two elements i,j G U' with c'(z) = where x^ and Xj 

are both fractional. Also, for any i ^ U' , the entry x' is integral (since Xj was integral and the 
entry was not modified by the process). Thus, the number of fractional entries in x' is at most 
k'. Now, for any dimension 1 < r < d, 

^i^') = ^4cr{i) + ^XiCr{i) 

< (1 + e/2) . ^ x^ c;(i) + ^ x^ ((1 + 6/2)4(0 + 

= (1 + e/2) . ^ x^ 4(0 + ^ x,^ < (1 + e)Lr. 
ieu ieu' 

This completes the proof. □ 

Using the above lemma, we can reduce the number of fractional entries in x to a number 
that is poly-logarithmic in k. However, the number of values D ~ x remains super-polynomial. 
To reduce further the number of fractional entries, we apply the above step twice, that is, we 
convert x with at most \U\ fractional entries to x' with at most k' = (81n(2|L'"|)/£)''. We can 
then apply the conversion again, to obtain x" with at most k" = 0(log \ U\) fractional entries. 

Lemma 3.2 Given a vector L and a constant e > 0, let x £ [0, 1]^ be a vector satisfying 
c(x) < L. Then x can be converted in time polynomial in \U\ to a vector x' with at most 



10 



k" = 0(log \U\) fractional entries, such that c(x') < (1 + e^L, and F{x') > F{x), 

The next result follows immediately from Lemma [2.2l {O is the value of an optimal solution 
for SUB). 

Lemma 3.3 Given x G [0, l]*^ such that x is a feasible fractional solution with F{x) > 0/5, 
there exists a realization of the random variable D ^ x, such that the solution D is nearly 
feasible, and F{V) > (1 - 0{e))F{x). 

Consider the following rounding algorithm. 

Deterministic Rounding Algorithm for SUB with No Big Elements 

Input: A SUB instance, a feasible solution x for the continuous problem, with F{x) > 0/5. 

1. Define x' = (1 + e)''^ ■ x (note that F{x') > (1 + e)"^ • F{x)). 

2. Convert x' to x" such that x" is fractionally feasible, the number of fractional entries in 
x" is 0(log |f7|), and F{x) > (1 + e)-2F(x") > (1 - e'^ - 0{e))0, as in Lemma[321 

3. Enumerate over all possible realizations of -D ~ x" . For each such realization, if the 
solution T> is e-nearly feasible convert it to a feasible solution V (see Lemma 12. 3p . 
Return the solution with maximum value among the feasible solutions found. 

By Theorem 12.11 the algorithm returns a feasible solution of value at least (1 — 0{e))F{x). 
Also, the running time of the algorithm is polynomial when e is a fixed constant. Replacing 
the randomized rounding step in the algorithm of Section 12.41 with the above Deterministic 
Rounding Algorithm, we get the following result. 

Theorem 3.4 Let f be a submodular function, and assume we have an oracle for F. If there 
is a deterministic polynomial time a- approximation algorithm for the continuous problem with 
respect to f , then there is a polynomial time deterministic [a — e)- approximation algorithm for 
SUB with respect to f , for any e > 0. 

We note that, given an oracle to F, both the algorithms of |24) and [19] for the continuous 
problem are deterministic, thus we get the following. 

Theorem 3.5 Given an oracle for F , there is a polynomial time deterministic (1 — e~^ — e)- 
approximation algorithm for SUB with a monotone function, for any e > 0. 

Theorem 3.6 Given an oracle for F, there is a polynomial time deterministic (1/4 — e)- 
approximation algorithm for SUB for any e > 0. 

For the problem of maximum coverage with d knapsack constraints, i.e., SUB where the 
objective function is / = fcp, for a given bipartite graph G and profits p, the function F can 
be evaluated deterministically (see flj). This yields the following result. 

Theorem 3.7 There is a polynomial time deterministic (1 — — e) -approximation algorithm 
for maximum coverage with d knapsack constraints. 

4 Discussion 

In this paper we established a strong relation between the continuous relaxation of SUB and 
the discrete problem. This relation is nearly optimal and suggests that future research should 
focus on deriving better approximation ratios for the continuous problem. 
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The question whether better rounding exists remains open; namely, is it possible to obtain 
an a— approximation algorithm for SUB, given an a < 1 approximation algorithm for the con- 
tinuous problem? And more specifically, is there a polynomial time (1 — e~^)— approximation 
for SUB with monotone objective function? 

Finally, the running times of our algorithms are exponential in thus rendering them 
impractical. Yet, the hardness results for d-dimensional Knapsack (see, e.g., [14^ ?, [E]), a 
special case of SUB, hint that significant improvements over these running times may be 
impossible. 
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A Basic Properties of Submodular Functions 

In this section we give some simple properties of submodular functions. Recall that / : 2^ — t- M 
is a submodular function if f{S) + f{T) > f{S U T) + /(T n S) for any S,T CU. We define 
fT{S) = f{SUT)-f{T). 

Lemma A.l Let / : 2^ — )■ R 6e a submodular function with /(0) > 0, and let S = SiU S2LI 
. . .U Sk, where Si are disjoint sets. Then 

f{S)>f{Si) + f{S2) + ...f{Sk). 
Proof: By induction on k. For k = 2, since / is a submodular function, we have that 

/(Si) + f{S2) > fiSi u S2) + /(Si n ^2) = f{S) + /(0), 

and since /(0) > 0, we get that f{S) < f{Si) + /(^s). 

For k > 2, using the induction hypothesis twice, we have 

f{S) < fiSi) + f{S2) + ... f{Sk-2) + f{Sk-i U Su) < f{Si) + f{S2) + ... f{Sk). 

□ 
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Lemma A. 2 Let f : 2^ ^ M4. be a submodular function, and let S,Ti,T2 C U such that 
Ti C Ta andSnT2 = 0. Then, fr^iS) < fr, {S). 
Proof: Since / is submodular, 

f{S U Ti) + f{T2) > f{S U Ti U T2) + f{{S U Ti) n T2) = f{S U T2) + f{Ti). 

Hence, MS) < ItAS). □ 

Lemma A. 3 Let f : 2^ ^ M+ be a submodular function, and let S = SiL) S2U ■ ■ - U S^, where 
Si are disjoint sets. Then, 

k 

f{S)>Y.fs\S.iS^). 
1=1 

Proof: We note that 

k 



f{S) = Y.fs,u...us.^ASi). 
1=1 

By Lemma EH for each i > 1, fsiU...us,-ASi) > fs\s,iSi). Hence, 

k 

f{S)>Y,fs\sSS^)■ 



i=l 

□ 
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